chris hay

Multi-Head vs Grouped Query Attention. Claude AI, Llama-3, Gemma are choosing speed over quality?

What is Retrieval Augmented Generation (RAG) and JinaAI?

NVIDIA's Nemotron-4's is totally insane for synthetic data generation

Inside the LLM: Visualizing the Embeddings Layer of Mistral-7B and Gemma-2B

Getting Started with ReAct AI agents work using langchain

Is JavaScriptCore (JSC) really the reason bun.js is so fast? Is V8 that slow? Is JSC the fastest?

HuggingFace Fundamentals with LLM's such as TInyLlama and Mistral 7B

The future of AI agents is WebAssembly (get started now)

AHORA DICES QUE ME AMAS BY CHRIS OTERO

How the Gemma/Gemini Tokenizer Works - Gemma/Gemini vs GPT-4 vs Mistral

superduperdb supercharges your database for AI

Creating ReAct AI Agents with Mistral-7B/Mixtral and Ollama using Recipes I Chris Hay

Understanding STaR and how it powers Claude and Gemini/Gemma 2 (and maybe OpenAI Q* or Strawberry)

No te pierdas al hijo de Elsa Pataky y Chris Hemsworth hablando español

Chris Hay Goal - Huddersfield vs Swindon 98/99

Chris Hayes Breaks Down the $95 Billion Foreign Aid Package for Ukraine and Israel

Multi-Head Attention vs Group Query Attention in AI Models

Mistral-7B: Text Classification Thoroughbred or Doddling Donkey?

All the Reasons Why

Newly shed Woma Python looking pretty - with Chris Hay

fine tuning llama-2 to code

Chris Hay

Chris Hay, Managing Director, Centaur Robotics

Why NVidia's Nemotron is not for chat usage